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ABSTRACT 

This article investigates the optimization of sentiment analysis within online shopping. It begins by 
outlining the inherent challenges in sentiment analysis, particularly in handling large volumes of unstructured 
textual data and achieving accurate results amidst noise and context complexities. In response to these 
challenges, the proposed work introduces the utilization of Bayesian Optimization-inspired Random Forest 
(BO-RF) as a solution. This innovative approach aims to enhance sentiment analysis accuracy and efficiency 
by leveraging the combined strengths of Bayesian Optimization and Random Forest algorithms. The 
mechanism of the proposed work involves leveraging Bayesian Optimization to tune the hyperparameters of 
the Random Forest model efficiently. This process optimizes the model's performance, improving the 
accuracy of sentiment analysis tasks. BO-RF excels in feature selection, enabling the extraction of relevant 
information from text data while filtering out noise. Through comprehensive experiments and evaluations, 
the results demonstrate the superior performance of BO-RF compared to traditional sentiment analysis 
methods. The proposed approach achieves higher accuracy rates and greater efficiency in sentiment 


classification tasks, showcasing its potential to revolutionize sentiment analysis within online shopping. 
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1. INTRODUCTION 

Customer reviews play a pivotal role in 
shaping the online shopping experience. They 
provide valuable information beyond product 
descriptions, offering real-world perspectives from 
individuals who have already purchased. Positive 
reviews can instil confidence in potential buyers, 
while negative reviews can serve as warnings, 
guiding consumers away from potential pitfalls [1]. 
The transparency facilitated by customer reviews 
fosters a sense of trust between buyers and sellers, 
creating a virtual marketplace where informed 
decisions can be made. In addition to influencing 
individual purchasing decisions, customer reviews 
contribute to the overall reputation of products and 
sellers on online platforms [2], [3]. A high volume 
of positive reviews can enhance a product's visibility 
and credibility, ultimately driving sales. Managing 
and analyzing this influx of customer feedback 
presents its own set of challenges [4]. 


In the intricate landscape of business 
strategy, sentiment analysis serves as a sophisticated 
instrument, allowing companies to gain nuanced 
insights into customer perceptions. By scrutinizing 
diverse sources such as online reviews, social media 
interactions, and survey responses, organizations can 
distil valuable information regarding customer 


satisfaction, pinpoint areas for enhancement, and 
guide their decision-making with a data-centric 
approach [5]. However, the utility of sentiment 
analysis extends beyond its diagnostic capabilities, 
playing a pivotal role in brand management. In brand 
management, sentiment analysis becomes a 
proactive force [6]. Companies leverage this 
analytical tool to swiftly identify and address 
negative sentiments and_ strategically harness 
positive feedback to fortify their reputation in a 
fiercely competitive market. The ability to navigate 
the complex interplay of positive and negative 
sentiments allows businesses to mould their public 
image intentionally, fostering a more favourable 
perception among their target audience [7]. 


At the heart of sentiment analysis lies the 
indispensable role of machine learning, mainly 
through the avenue of supervised learning. This facet 
enables models to grasp the nuanced contextual 
intricacies inherent in language, navigating through 
layers of sarcasm, cultural subtleties, and diverse 
expressions with remarkable finesse [8]. 
Incorporating unsupervised learning techniques like 
clustering and topic modelling further refine 
sentiment analysis capabilities. By discerning 
intricate patterns within unlabeled data, these 
techniques contribute to a more comprehensive 
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understanding of the sentiment landscape [9], [10]. 
The synergy between machine learning and 
sentiment analysis is a testament to technological 
sophistication. This collaboration amplifies the 
precision of calculations and fortifies adaptability, 
ensuring that these analytical tools remain finely 
attuned to the ever-evolving dynamics of the digital 
communication landscape. In_ essence, the 
intersection of technology and human perception 
within sentiment analysis gives businesses a resilient 
and proactive framework for navigating the 
multifaceted currents of public opinion [11]. 


1.1. Problem Statement 

The challenge of contextual ambiguity and 
sarcasm complicates sentiment analysis, introducing 
uncertainties in accurately deciphering the 
sentiments embedded in user-generated content. 
Existing models often need to improve 
understanding of the subtle nuances and figurative 
language, resulting in misinterpretations. This 
problem significantly impacts the reliability of 
sentiment predictions, particularly in reviews where 
sarcasm and contextual intricacies are prevalent. 
Developing advanced natural language processing 
techniques to unravel the layers of linguistic 
complexity is imperative for overcoming this 
challenge and ensuring precise sentiment analysis in 
the face of contextual ambiguity and sarcasm. 


1.2. Motivation 

The motivation for investigating contextual 
ambiguity and sarcasm in sentiment analysis arises 
from the imperative to enhance the precision of 
sentiment predictions in user-generated content. 
Current models often need help deciphering subtle 
nuances and figurative language, leading to 
misinterpretations and compromised reliability. This 
research seeks to develop advanced natural language 
processing techniques that can unravel the layers of 
linguistic complexity, enabling sentiment analysis 
models to comprehend and accurately interpret 
contextual intricacies and sarcasm. The motivation 
stems from the aspiration to improve the practical 
applicability of sentiment analysis in real-world 
scenarios, where user expressions often entail 
diverse contextual nuances and figurative elements. 


1.3. Research Objective 

This research addresses contextual 
ambiguity and sarcasm in sentiment analysis by 
proposing and evaluating the Bayesian 
Optimization-based Random Forest (BO-RF) 
algorithm. The research objective is to enhance the 
precision of sentiment predictions in user-generated 
content, where existing models often need to 
improve in deciphering subtle nuances and 


figurative language. By leveraging the capabilities of 
BO-RF, we seek to develop advanced natural 
language processing techniques that can unravel the 
layers of linguistic complexity, enabling sentiment 
analysis models to interpret contextual intricacies 
and sarcasm accurately, thus improving the practical 
applicability of sentiment analysis in real-world 
scenarios. 


2. LITERATURE REVIEW 

"Masked Language Modelling" [12] 
leverages the techniques from masked language 
modelling. The model predicts affective tokens 
within the context of ABSA, allowing for a more 
nuanced understanding of sentiment expressions 
associated with specific aspects. It can capture the 
intricacies of sentiment nuances in diverse contexts. 
"Lexicon-based Unsupervised Sentiment Analysis" 
[13] focuses on adapting negation instances. By 
incorporating negation adjustments, this model 
refines its accuracy in discerning the sentiment 
associated with equipment health status, particularly 
in cases where negation impacts the overall 
sentiment context. It is a tailored sentiment analysis 
technique that caters to the specifics of industrial 
equipment monitoring, contributing to enhanced 
accuracy in health status assessments. "Spanish 
Financial Stability Report" [14] is proposed for 
applying sentiment analysis techniques to the 
financial domain, offering insights into sentiments 
prevalent in financial stability reports written in 
Spanish. Addressing the unique language of 
economic discourse provides a more accurate 
understanding of sentiment expressions, 
contributing to a nuanced comprehension of 
sentiments within financial stability reports in 
Spanish. 


"Aspect-Based Sentiment Analysis 
(ABSA)" [15] is the creation of a comprehensive and 
domain-specific dataset that facilitates the training 
and evaluation of ABSA models. It includes diverse 
user reviews annotated with aspect-level sentiments, 
offering a valuable resource for developing, refining, 
and benchmarking ABSA algorithms. "Pseudo 
Dense Counterfactual Augmentation" [16] designed 
to enhance ABSA. This approach addresses the 
challenge of data sparsity and introduces a 
mechanism for capturing diverse sentiments. It 
contributes to the improved generalization and 
robustness of ABSA models, ensuring more 
effective sentiment analysis in scenarios with limited 
labelled data. The "Bayesian Sentiment Analysis" 
[17] approach addresses the challenges of cold start 
and sparsity in ranking-based recommender systems. 
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It focuses on the recommendation process, 
enhancing adaptability to scenarios with limited 
data. It mitigates sparsity issues in ranking-based 
recommender systems. It represents an advancement 
in recommender system research, offering a tailored 
solution to limited user interactions. 


"Big Data Sentiment Analysis" [18] is 
proposed for Large-scale Text Processing (LTP) that 
aims for effective text classification. It enables a 
deep analysis of sentiments within the business 
environment. It focuses on public perception of 
business dynamics in Heilongjiang Province, 
showcasing the applicability of advanced text 
classification techniques in discerning sentiments at 
a large scale. "Prompt Semantic Augmented 
Network (PSAN)" [19] incorporates prompt 
semantic augmentation by enhancing its ability to 
understand and analyze sentiments associated with 
specific aspects. PSAN leverages to augment 
semantic understanding, contributing to a more 
nuanced and context-aware sentiment analysis. It 
provides the importance of semantic augmentation 
for improved accuracy in dissecting sentiments 
across various aspects. "Online Medical Reviews" 
[20] focuses on constructing an ABSA model aiming 
to meticulously design a model capable of discerning 
sentiments associated with specific elements within 
the complex domain of medical reviews. The 
intricacies of medical language and diverse aspects 
within studies offer a better understanding of 
sentiments, providing valuable insights for 
healthcare professionals and stakeholders. 


"Multi-Information Fusion and Interaction 
Neural Network" [21] is proposed to allow the model 
to fuse diverse sources of information effectively for 
a comprehensive sentiment analysis. This neural 
network model addresses the challenge of capturing 
sentiments across various aspects by dynamically 
interacting with multiple information channels, 
contributing to enhanced accuracy. The importance 
of synthesizing diverse information is deeply 
analyzed for better accuracy. "Text-Guided Multi- 
Task Learning Network" [22] focus on efficient 
analysis of sentiments across multiple modalities, 
providing a cohesive understanding of sentiments in 
diverse data sources. It narrows to multimodal 
sentiment analysis by introducing a network 
architecture that optimally utilizes textual guidance 
to improve sentiment understanding across multiple 
tasks. "Disentanglement Translation Network" [23] 
facilitates effective translation and sentiment 
analysis across different modalities. This model 
enhances its ability to capture the unique 
characteristics of each modality, contributing to a 


more refined understanding of sentiments in 
multimodal data. It represents a_ significant 
advancement in the quest for sophisticated 
multimodal sentiment analysis methodologies. 


"Cloze Task Network based Convolutional 
Hierarchical Attention Networks (CCHAN)" [24] 
elevates the model's capacity for adaptive sentiment 
understanding. Unlike static embeddings _ that 
maintain a fixed representation for each word, 
dynamic embeddings in CCHAN adjust in real-time 
based on the surrounding context within sentences. 
It is pivotal in scenarios where sentiment subtly 
evolves within the confines of the same document or 
review. By dynamically refining contextual 
embeddings, CCHAN excels at capturing the 
nuanced changes in sentiment expression, resulting 
in a model that is accurate and highly sensitive to the 
intricacies of sentiment shifts. This feature makes 
CCHAN an invaluable tool for applications 
demanding fine-grained sentiment analysis in 
dynamic linguistic environments, ensuring its 
efficacy in capturing the subtle nuances inherent in 
various forms of textual communication. Bio- 
inspired optimization is also applied in multiple 
domain to attain better results [25], [26], [35]-[44], 
[27]-{34]. 


"Hybrid Neural Network techniques using 
Binary Coordinate Ascent Algorithm (HNN-BCA)" 
[45] extends its prowess beyond static evaluations by 
incorporating a robust temporal analysis approach. 
Unlike traditional sentiment analysis models, HNN- 
BCA considers the dynamic nature of sentiment 
expression over time. The HNN architecture is 
augmented with temporal features, allowing the 
model to capture evolving sentiments in a sequence 
of text. The BCA Algorithm complements this by 
optimizing the model's secular learning, ensuring it 
adapts effectively to changing sentiment patterns. 
This temporal dimension adds a new layer of 
sophistication to sentiment analysis, enabling 
applications to track sentiment trends, event-specific 
reactions, and dynamic social media discourse. 


3. BAYESIAN OPTIMIZATION 
FOREST (BA-RF) 

Bayesian Optimization-inspired Random 
Forest (BO-RF) merges the strengths of two 
powerful techniques: Bayesian optimization and 
random forests. This hybrid approach combines the 
robustness of random forests in handling complex 
datasets with the efficiency and adaptability of 
Bayesian optimization in hyperparameter tuning. 
BO-RF optimizes the hyperparameters of the 
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random forest algorithm using a _ Bayesian Algorithm 1: Initialization of Random Forest 


optimization framework, allowing for intelligent 
exploration of the hyperparameter space to 
maximize model performance. BO-RF converges to 
an optimal solution more efficiently than traditional 
random search methods by iteratively selecting 
hyperparameters based on _ their expected 
performance. The Bayesian optimization component 
allows BO-RF to learn from previous iterations, 
adapt its search strategy, and allocate computational 
resources more effectively, leading to faster 
convergence and improved model accuracy. This 
approach is particularly beneficial for tasks where 
model performance is critical, such as predictive 
modeling, classification, and regression tasks in 
various domains. 
3.1. Initialization of Random _ Forest 
Hyperparameters 

The initialization of hyperparameters in the 
BO-RF algorithm is crucial to set the foundation for 
subsequent optimization. The Random Forest 
algorithm is characterized by hyperparameters 
denoted as 0 = {0,, 02,....,0,}, where k represents 
the number of hyperparameters. These 
hyperparameters include the number of trees 
(Nirees), Maximum depth of each tree (Dingx), and 
other relevant settings. The initial configuration is 


represented as Og, where Oy = Ci O25 + - On}. 
O= {6,, 2, ou Ons (1) 
Oy = 104, Poprainr Ong} (2) 


Define the objective function f(@) that 
encapsulates the performance metric to be 
optimized, such as classification accuracy or mean 
squared error in a regression task. 


f(O) = Performance Metric (3) 


Select a Gaussian process (GP) as the 
surrogate model to approximate the unknown 
objective function f(@). The goal function and its 
uncertainty may be approximated probabilistically 
using the surrogate model. 


GP(u(®),2()) (4) 


where (@) is the mean function and X(@) is the 
covariance function of the Gaussian process. 


Hyperparameters 
Input: 
e =O: Set of Random Forest 
hyperparameters. 


e kk: Number of hyperparameters. 


Output: 
e Qo : Initial configuration of Random 
Forest hyperparameters. 


Procedures: 
Step 1: Initialize Hyperparameters(@, k): 
a. Nerees: Dmax> and other 
hyperparameters are elements of 


b.  Nereesy»Dmaxy» and other initial 


hyperparameter values are 
elements of Oo. 
c 0,=0 (Copy initial 


hyperparameter values). 

Step 2: Define Objective Function(): 

a. f(@) represents the objective 
function. 

b. Performance metric is calculated 
based on 0. 

Select Surrogate Model(): 

a. Gaussian process GP is chosen as 
the surrogate model. 

b. u(O) and X(@) denote the mean 
and covariance functions of the 
Gaussian process, respectively. 


Step 3: 


3.2. Surrogate Modeling with Gaussian Process 

The second step in the BO-RF algorithm 
involves selecting a surrogate model to approximate 
the unknown objective function using a Gaussian 
process (GP). This process aims to capture the 
underlying performance landscape of the Random 
Forest hyperparameters. To begin, this research 
initializes the surrogate model by defining the 
Gaussian process GP with its mean function u(0) 
and covariance function ¥(@). 


GP(u(@), 2(0)) (5) 


where @ represents the Random Forest 
hyperparameters. This Gaussian process serves as a 
probabilistic model, providing insights into the 
behaviour of the objective function and accounting 
for uncertainties in the performance landscape. 


The surrogate model aims to map the 
Random Forest hyperparameters 9 to the objective 
function f(0). This mapping is established through 


oa eee ee renee renee eee eee 
2655 


Journal of Theoretical and Applied Information Technology 
31° March 2024. Vol.102. No 6 


© Little Lion Scientific 


ISSN: 1992-8645 


www. jatit.org 


SATIT 


E-ISSN: 1817-3195 


the mean function “(@), representing the expected 
value of the objective function. 


f(O) = u(®) (6) 


Eq.(6) is a predictive tool estimating the 
objective function's behaviour across the 
hyperparameter space. 


In parallel, the covariance function ¥(0) 
characterizes the relationships between different 
hyperparameter configurations. It captures the 
uncertainty associated with the objective function's 
predictions and shapes the Gaussian process. 


2) (7) 


Eq.(7) encodes how changes in one set of 
hyperparameters correlate with changes in another, 
providing a measure of similarity or dissimilarity. 


The initialization of the surrogate model at 
this stage is crucial. The initial set of Random Forest 
hyperparameters ©, from the Eq.(2) influences the 
starting configuration of the Gaussian process. It sets 
the tone for subsequent Bayesian Optimization 
iterations, impacting the efficiency of the search for 
optimal hyperparameters. 


Algorithm 2: Surrogate Modeling with Gaussian 
Process 
Input: 
e @: Random Forest hyperparameters. 
e f (0): Objective function. 
e kk: Number of hyperparameters. 


Output: 
e §GP(u(@),X(@)): Gaussian process 
surrogate model. 
Procedure: 
Step 1: InitializeSurrogateModel 
(0, f(@),k): 
a. Create an empty Gaussian 
process GP. 


b. Define the mean function (0) 
and covariance function 2(@) 
for the Gaussian process. 

MapObjectiveFunction(): 

a. Utilize the surrogate model to 
map the Random Forest 
hyperparameters O to the 
objective function f (0). 


Step 2: 


b. Establish f(@) =u) 
through the mean function. 


Step 3: CaptureCovarianceStructure(): 

a. Capture the covariance 
structure 2(0) within the 
Gaussian process. 

b. The covariance _ structure 
characterizes relationships 
between different 
hyperparameter configurations. 

Step 4: ImpactofInitialization(): 


a. Acknowledge the impact of the 
initial set of hyperparameters 
© on the surrogate model. 

b. The initial configuration 
influences subsequent 
iterations in the Bayesian 


Optimization process. 


Algorithm 2 outlines the steps involved in 
surrogate modelling with a Gaussian process within 
the context of the BA-RF algorithm. It emphasizes 
the initialization of the surrogate model, mapping the 
objective function, and capturing covariance 
structures to provide a probabilistic estimate of the 
objective function's behaviour. 


3.3. Capturing Covariance Structure in Gaussian 
Process 

In the BA-RF algorithm, the third step 
involves capturing the covariance structure within 
the Gaussian process surrogate model. This crucial 
step enhances our understanding of how changes in 
one set of Random Forest hyperparameters correlate 
with changes in another, shaping the probabilistic 
model's predictive capability. 


The covariance structure, denoted as ¥(0), 
is a fundamental component of the Gaussian process 
(GP) surrogate model. It characterizes the 
relationships between different configurations of 
Random Forest hyperparameters. Specifically, the 
covariance matrix 2(@) quantifies the degree of 
similarity or dissimilarity between pairs of 
hyperparameter sets. 


2) (8) 


This matrix provides insights into how 
variations in one hyperparameter impact the 
objective function in conjunction with variations in 
other hyperparameters. The surrogate model utilizes 
this information to predict the objective function's 
behaviour across the hyperparameter space. 
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The elements of the covariance matrix 
%(@) are determined by the choice of the covariance 
function. Each element }';;(@) represents the 
covariance between the i-th and j-th 
hyperparameters. The diagonal elements ));;(@) 
indicate the variance of the i-th hyperparameter. 


>, (9) 


This matrix encapsulates the 
interdependence among hyperparameters, 
contributing to the model's understanding of the 
objective function landscape. 


The covariance structure also encodes 
information about the relationships between 
hyperparameters. Positive covariance (3; > 0) 
signifies a positive correlation, suggesting that 
increases in one hyperparameter will likely coincide 
with increases in another. Conversely, negative 


covariance (2; jX< 0) indicates a negative 
correlation. 
Covariance Relationship: %,; is 
> 0 (positive correlation) 
. : . : (10) 
Covariance Relationship: %,; is 
< 0 (negative correlation) 


Understanding these relationships is crucial 
in guiding the search for optimal hyperparameters 
during the subsequent Bayesian Optimization 
iterations. 


Algorithm 3: Capturing Covariance Structure in 
Gaussian Process 


Input: 
e @: Random Forest hyperparameters. 
e kk: Number of hyperparameters. 


Output: 
e X(@): Covariance matrix capturing 
relationships between hyperparameters. 


Procedure: 
Step 1: | CaptureCovarianceStructure(@, k): 

a. Initialize an empty covariance 
matrix ¥(@) of size k x k. 

b. Define the covariance function 
to determine _ relationships 
between hyperparameters. 

Step 2: | DetermineCovarianceElements(): 


a. Iterate through each pair of 


hyperparameters, calculating 


dij(@) for the covariance 
matrix. 

b. Diagonal elements ());(0)) 
represent the variance of each 
hyperparameter. 

PositiveCovarianceRelationship(): 

a. Identify positive covariance 
relationships (2; j> 
0) between pairs of 
hyperparameters. 

b. Positive covariance signifies a 
positive correlation, suggesting 
concurrent increases in 
hyperparameter values. 

NegativeCovarianceRelationship(): 

a. Identify negative covariance 
relationships (2; i< 0) 
between pairs of 
hyperparameters. 

b. Negative covariance signifies a 
negative correlation, indicating 
concurrent decreases or 
increases in hyperparameter 
values. 

OutputCovarianceMatrix(): 

a. Return the covariance matrix 
%(@) capturing relationships 
between Random Forest 
hyperparameters. 


Step 3: 


Step 4: 


Step 5: 


Algorithm 3 captures the covariance 
structure within the Gaussian process surrogate 
model. The procedure calculates covariance 
elements, identifies relationships, and produces a 
covariance matrix that informs the probabilistic 
modelling of the objective function landscape. 


3.4. Bayesian Optimization Iterations 


This step builds on the surrogate model 
established in earlier steps, leveraging the 
probabilistic insights gained from the Gaussian 
process to make informed decisions about where to 
evaluate the objective function next. 


The acquisition function, denoted as A(0), 
plays a pivotal role in the iterative optimization 
process. It acts as a heuristic to balance exploration 
and exploitation. Several standard acquisition 
functions exist, such as Expected Improvement (EI), 
Probability of Improvement (PI), and Upper 
Confidence Bound (UCB). The selection of the 
acquisition function is a critical aspect of Bayesian 
Optimization, influencing the algorithm's 
exploration strategy. 
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Step 1: InitializeBayesianOptimization 
A(@) (12) 


The acquisition function guides the 
selection of the next set of Random Forest 
hyperparameters to evaluate the objective function. 
By maximizing the acquisition function, the 
algorithm identifies promising regions in the 
hyperparameter space. The chosen hyperparameter 
configuration is denoted as Onoxt- 


Onext = argmax A(O) (13) 


The algorithm evaluates the objective 
function at the selected hyperparameter 
configuration One,; . This involves computing the 
performance metric for the Random Forest model 
with the chosen hyperparameters. 


f (next) (1 4) 
= Objective Function Evaluation 

The surrogate model is updated with the 
new observation following the objective function 
evaluation. The updated Gaussian process now 
incorporates information about the objective 
function's behaviour at the chosen hyperparameter 
configuration. 


GP(u(),2@)) (15) 


The surrogate model is updated iteratively 
as the objective function is evaluated and 
hyperparameters are selected until a stopping 
criteria, such as convergence or a preset number of 
iterations, is reached. 


Algorithm 4. Bayesian Optimization Iterations 


Input: 
e A(@): Acquisition function. 


e @: Random Forest hyperparameters. 
e f (0): Objective function. 
e kk: Number of hyperparameters. 
e Stopping criterion. 
Output: 
© § Onexe : Hyperparameter configuration. 


e =f (Onext.): Objective function evaluation. 
e Updated surrogate model 
GP(u(@), 2(0)). 


Procedure: 


(A(0), 0, f(@), k, Stopping Criteria 


a. Set initial hyperparameters 0 
based on the surrogate model 
initialization. 

b. Initialize surrogate model GP 
with 0) and f(@o). 

c. Set iteration counter t = 0. 

IterativeOptimization(): 

a. While the stopping criterion is 
not met: 

e §=Calculate acquisition 
function A(@,) using the 
surrogate model. 


Step 2: 


e = Identify the next 
hyperparameter 
configuration Onext = 
arg max A(0,;). 

e Evaluate the objective 


function at Onext: f (Onext): 
e Update the surrogate model 
GP with the new 
observation 
(Orn f (Cree) ): 
e Increment iteration counter 
t=tt+1. 


Algorithm 4 outlines the iterative 
optimization process in Bayesian Optimization, 
emphasizing the role of the acquisition function in 
guiding the exploration-exploitation trade-off. The 
procedure iteratively refines the hyperparameter 
configuration based on the surrogate model's 
probabilistic insights until the stopping criterion is 
met. 


3.5. Updating the Surrogate Model 


In BO-RF algorithm, the fifth step involves 
updating the surrogate model after each iteration. 
This crucial step ensures that the probabilistic model 
accurately represents the observed behaviour of the 
objective function at the chosen hyperparameter 
configurations. 


Upon the evaluation of the objective 
function at the selected hyperparameter 
configuration 0,.¢,, the surrogate model is updated 
to incorporate this new observation. The Gaussian 
process (GP) is refined to reflect better the true 
performance landscape of the Random Forest model. 


GP(u(@), =(0)) (16) 
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The updated surrogate model is based on an 
augmented dataset that includes the previous 
observations (0;, f (0,)) and the most recent 
evaluation (Oisuti f (Onder): 

{(0;, f (;)) |é = 1,2, ...t 
(17) 
~ 1, (Greer f @next))} 

The mean function “(@) of the Gaussian 
process is recalculated based on the augmented 
dataset. It represents the expected value of the 
objective function at any given hyperparameter 
configuration. 


u(@) (18) 


Simultaneously, the covariance function 
%(@) is adjusted for the new observation. The 
covariance structure characterizes the relationships 
between different hyperparameter configurations 
and plays a crucial role in guiding the exploration of 
the hyperparameter space. 


2@) (19) 


Algorithm 5 captures the continuous 
refinement of the Gaussian process, ensuring it 
accurately represents the observed behaviour of the 
objective function at different hyperparameter 
configurations. 


Algorithm 5. Updating the Surrogate Model 


Input: 
e §6GP(u(@),X(@)): Surrogate model with 
mean and covariance functions. 
© §$Onext: Newly evaluated hyperparameter 
configuration. 
© =f (next): Objective function value at 


Onext : 
e Dataset: [Existing dataset with 
hyperparameter configurations and 


corresponding objective function values. 


Output: 

e Updated surrogate model 
GP(u(@),X(@)) reflecting the new 
observation. 

Procedure: 
Step 1: UpdateSurrogateModel 


(GP(u(@), 2()), Onextf @next)) 


a. Augment the dataset with the 
new observation: Dataset <— 


Dataset U {(Onext, f (Onext))} 


b. Recalculate the mean function 
H(@) based on the augmented 
dataset. 

c. Adjust the covariance structure 
%(@) to account for the new 
observation. 

d. Return the updated surrogate 
model GP(u(@), X(@)) 
reflecting the refined 
understanding of the objective 
function landscape. 


3.6. Out-Of-Bag (OOB) Error Estimation 


OOB leverages the inherent nature of the 
Random Forest algorithm, which constructs multiple 
decision trees based on bootstrapped subsets, to 
estimate the model's performance without requiring 
a separate validation set. 


Before delving into OOB error estimation, 
it is essential to understand the training process of 
the Random Forest. The ensemble of decision trees, 
denoted as T,,T2,...,Ty , 1S constructed based on 
bootstrapped subsets of the original dataset. Each 
tree is trained with a subset of the data, introducing 
diversity into the ensemble. 


T,,T>, ...,Ty are decision trees (20) 


During the construction of each decision 
tree T;, a subset of samples is left out, forming the 
OOB samples. These OOB samples are denoted as 
Doog and consist of data points not included in the 
bootstrap sample for the i-th tree. 

Doop; = Out — of (21) 
— bag samples f or T; 

The OOB samples are then used to obtain 
predictions from each decision tree T;. Let Yprea; 
represent the predicted values on the OOB samples 
by the i-th tree. 


Yprea; = T;(OOB samples ) (22) 


The predictions from individual trees are 
aggregated to obtain the final ensemble prediction on 
the OOB samples. For classification tasks, a majority 
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vote is often employed, while for regression tasks, 
the predictions are averaged. 


N 
1 
Final OOB Prediction = =). Ypred; (23) 
i=1 


The utilization of OOB samples for error 
estimation aligns with the overarching goal of 
understanding the model's performance. The 
updated surrogate model from Section 3.5 enhances 
the accuracy of predictions, contributing to a more 
reliable OOB error estimate. This step ensures a 
robust evaluation of the Random Forest model's 
performance, facilitating informed decisions in the 
subsequent steps of the BO-RF algorithm. This 
mathematical description elucidates the integral role 
of OOB error estimation in the BA-RF algorithm, 
bridging the gap between the training process and 
the overall evaluation of the model's effectiveness. 


Algorithm 6. Out-of-bag (OOB) Error 
Estimation 


Input: 
e Random Forest ensemble T,, 72, ...,Ty. 


e Original dataset. 

e Training process details. 

e Decision tree construction details. 
Output: 


e OOB predictions. 
e Aggregated OOB prediction. 


Procedure: 
Step 1: RandomForestTraining(): 
a. _ For each decision tree T; in the 
ensemble: 

e Construct a _ bootstrap 
sample from the original 
dataset. 

e =6 Train T; with the bootstrap 
sample. 

e Identify the OOB samples 
Doog, left out during the 
construction of T;. 

e Predict values Ypreq, on 
the OOB samples using T;. 

Step 2: AggregateOOBPredictions(): 


a. Aggregate individual OOB 
predictions Yyreq, from all 
decision trees. 

b. Use a _ majority vote for 
classification tasks to obtain the 
final OOB prediction. 


c. For regression tasks, average 
the OOB predictions to obtain 
the final ensemble prediction. 


Algorithm 6 outlines the steps involved in 
the OOB error estimation process within the context 
of the BA-RF algorithm. It details the training of the 
Random Forest ensemble and the subsequent 
aggregation of OOB predictions for a robust 
evaluation of model performance. 


3.7. Bayesian 
Criterion 


Optimization Convergence 


In the BO-RF algorithm, the seventh step 
involves establishing a convergence criterion to 
determine when to conclude the optimization 
process. This criterion is essential for preventing 
unnecessary iterations, ensuring computational 
efficiency, and providing a reliable solution for the 
identified hyperparameters. 


The convergence criterion often relies on 
monitoring the behaviour of the objective function 
value over successive iterations. Let f, denote the 
objective function value at iteration t, and f,_; 
represent the objective function value at the previous 
iteration. The optimization process is considered to 
have converged when the change in the objective 
function value is below a predefined threshold e. 


lft —frail Se (24) 


Convergence can be defined based on the 
stability of the identified hyperparameter 
configuration. Let 0, represent the hyperparameter 
configuration at iteration t, and 0,_, denote the 
hyperparameter configuration at the previous 
iteration. Convergence is achieved when the 
hyperparameter configurations remain stable over 
iterations. 


(25) 


To ensure computational efficiency, a 
predefined maximum number of iterations (Tingx) 
can be established. The optimization process 
concludes when the number of iterations reaches this 
limit. 


(26) 


Algorithm 7 involves assessing 
convergence in the BA-RF algorithm. It provides 
mechanisms to determine convergence based on 
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changes in the objective function values, stability of 
hyperparameter configurations, and adherence to a 
predefined iteration limit. The convergence decision 
is a critical aspect of the overall optimization 
process. 


Algorithm 7: Bayesian Optimization 

Convergence Criterion 

Input: 
e Objective function values f;. 
e Hyperparameter configurations 0,. 
e Maximum number of iterations Tnx 
e Convergence threshold e. 
Output: 
e Convergence decision. 
Procedure: 
Step 1: | ObjectiveFunctionConvergence 

(ft, €): 

a. Initialize t=1 as the first 
iteration. 

b. While t > 1: 

e =Check if |f, — ft_i] < €. 
e 6If true, consider 
convergence achieved. 
e = If false, continue to the next 
iteration. 
Step 2: HyperparameterConfigurationConve 
rgence(0,): 

a. Initialize t=1 as the first 
iteration. 

b. While t > 1: 

e Check if 0, = 0,3. 
e If true, consider 
convergence achieved. 
e = If false, continue to the next 
iteration. 
Step 3:  IterationLimit( t, Tinax): 

a. Check if t = Tmax. 

b. If true, consider convergence 
achieved. 

c. If false, continue to the next 
iteration. 

Step 4: ConvergenceDecision(): 

a. Return the decision based on 
the convergence criteria. 

b. If any of the convergence 
criteria are met, consider 
convergence achieved. 

c. If none of the criteria are met, 
continue the optimization 
process. 


3.8. Optimal Hyperparameter Selection 

This step marks the culmination of the 
algorithm, providing a_ refined set of 
hyperparameters that yield optimal performance for 
the given task. The primary goal is to optimize the 
objective function f(@) for the Random Forest 
hyperparameters @. The identified optimal 
hyperparameters, denoted as @*, maximize or 
minimize the objective function, depending on the 
nature of the optimization task. 


The surrogate model, represented by the 
Gaussian process (GP), has been continuously 
refined throughout the Bayesian Optimization 
iterations. The optimal hyperparameters are often 
selected based on the mean function u(@) of the 
Gaussian process, representing the expected value of 
the objective function. 


0* = arg maxpu(O) (27) 


While the mean function guides the 
selection of the optimal hyperparameters, it is 
essential to consider the uncertainty associated with 
the predictions. The covariance function 2(0) 
measures uncertainty, allowing for a more informed 
decision. 

Incorporate uncertainty : 0* 
=arg max(“(@) 
+k. a(0)) 
where, o(@) represents the standard deviation 
associated with the Gaussian process predictions, 
and x is a user-defined parameter that balances 
exploration and exploitation. 


(28) 


Optimal Hyperparameter Selection 
elucidates the mathematical framework for selecting 
optimal hyperparameters in the BO-RF algorithm. 
Incorporating uncertainty provides a nuanced 
approach to decision-making, ensuring a balance 
between exploiting known promising regions and 
exploring potentially superior areas in the 
hyperparameter space. 


Algorithm 8: Optimal Hyperparameter Selection 


Input: 
e Surrogate model GP(u(O),2(O)) with 
mean and covariance functions. 
e Exploration-exploitation parameter x. 


Output: 
e = Optimal hyperparameters 
@* maximizing the objective function. 
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Procedure: 
Step 1: | SelectOptimalHyperparameters 

(GP(u(@), X(@)), K): 

a. Calculate the mean function 
H(@) based on the surrogate 
model. 

b. Calculate the standard 
deviation function o(@) based 
on the surrogate model. 

c. Incorporate uncertainty into the 


selection —_ process: O* = 
arg max(u(0) +K- a(0)). 
d. d. Return the — optimal 


hyperparameters 0". 


Algorithm 8 outlines the process for 
selecting optimal hyperparameters in the BA-RF 
algorithm. It utilizes the refined surrogate model to 
balance exploration and exploitation, considering 
both the mean function and the uncertainty captured 
by the standard deviation. The result is a set of 
hyperparameters that maximize the objective 
function within the defined constraints. 


3.9. Model Retraining 
Hyperparameters 

The BO-RF algorithm involves retraining 
the Random Forest model using the optimal 
hyperparameters identified in Section 3.8. This step 
ensures the model achieves its maximum potential 
performance by utilizing the refined set of 
hyperparameters gleaned through the Bayesian 
Optimization process. 


with Optimal 


The optimal hyperparameters, denoted as 
0* , are the result of the Bayesian Optimization 
convergence and selection process in Step 8. These 
hyperparameters represent the configuration that 
maximizes or minimizes the objective function, 
embodying the refined understanding of the Random 
Forest model's behaviour. 


0* = f(0*) = max or min f (@) (29) 

With the optimal hyperparameters in hand, 
the Random Forest model is retrained using the 
entire original dataset. This step contrasts with the 
iterative optimization process, where the model was 
trained on bootstrapped subsets. Retraining the full 
dataset ensures that the model benefits from all the 
available information and generalizes well to new, 
unseen data. 


(30 
) 


Retrain Random Forest Model with 0* 


The complete training set is denoted as 
Derain » 18 employed for retraining. This set 
encompasses all data points from the original 


dataset, ensuring a comprehensive learning 
experience for the Random Forest model. 
Derain = Full training set (31) 


This mathematical description highlights 
the pivotal role of model retraining in the BA-RF 
algorithm, emphasizing the utilization of the optimal 
hyperparameters to maximize the model's predictive 
power. 


3.10. Model 
Assessment 

This step is crucial for assessing the model's 
effectiveness in predicting new, unseen data and 
providing insights into its generalization 
capabilities. Define an evaluation metric, denoted as 
E, that quantifies the model's performance. This 
metric is chosen based on the nature of the task, 
whether it be a classification or regression problem. 
Standard evaluation metrics include accuracy, 
precision, recall, F1 score for classification, or mean 
squared error for regression. 


Evaluation and Performance 


E = Evalution Metric (32) 


Utilizing a separate test dataset, denoted as 
Deest,; Which was not used during the training or 
optimization processes. This dataset accurately 
indicates the model's ability to generalize to new, 
unseen data. 


Drest = Test dataset (33) 


Generate predictions from the retrained 
Random Forest model on the test dataset. Let Ypreq 
represent the vector of predicted values. 


YVored (34 
= Retrained Model Predictions on Dyest _) 


Evaluate the model's predictions using the 
chosen evaluation metric E'. This step quantifies the 
model's performance on the test dataset and provides 
insights into its real-world applicability. 


Equation 36: Model Perf ormance 


= E(True values from Drest , Yprea (39) 


Algorithm 9 shows how to evaluate the 
performance of the retrained Random Forest model 
on a separate test dataset. The chosen evaluation 
metric quantitatively measures how well the model 
generalizes to new, unseen data, providing valuable 
insights into its real-world applicability. 
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Algorithm 9: Model Evaluation and Performance 
Assessment 


Input: 
e ~=Retrained Random Forest model. 
e = Test dataset Drest . 
e Evaluation metric E. 


e Model performance based on the chosen 
evaluation metric. 


Procedure: 
Step 1: EvaluateModelPerformance 
(Retrained Model, Drest, E): 

a. Generate predictions Yprea 
from the retrained Random 
Forest model on the test dataset 
D test* 

b. Calculate the model 
performance using the chosen 
evaluation metric E by 
comparing the predicted values 
Yprea With the true values from 
the test dataset. 

c. Return the model performance 


based on the evaluation metric. 


Algorithm 10 outlines the step-by-step 
process of the BO-RF algorithm, encompassing data 
preparation, model _ initialization, iterative 
optimization, hyperparameter selection, model 
retraining, and final performance evaluation. 


Algorithm 10. Bayesian Optimization Random 
Forest (BO-RF) 


Input: 

Dataset D. 

Random Forest model architecture. 
Acquisition function parameters. 
Bayesian Optimization parameters. 
Evaluation metric EF’. 

Maximum number of iterations Ting - 
Convergence threshold e€. 


e Optimal hyperparameters. 
e Retrained Random Forest model. 
e Model performance based on the chosen 
evaluation metric. 
Procedure: 
Step 1: —InitializeRandomForest 


(D, Random Forest model arch 
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a. Split the dataset D into training 
and validation sets. 

b. Initialize the Random Forest 
model with default 
hyperparameters. 

c. Train the initial Random Forest 
model on the training set. 

Step 2: SurrogateModellInitialization(): 

a. Initialize the surrogate model 
GP(u(@),2(@)) with the 
initial Random Forest model's 
performance on the validation 
set. 

Step 3: — IterativeOptimization(T max , €): 

a. For each iteration t up to Tmax: 

e Use the surrogate model to 
select the next 
hyperparameter 
configuration. 

e Train a Random Forest 
model with the selected 
hyperparameters on the 
training set. 

e Evaluate the model 
performance on the 
validation set. 

e Update the — surrogate 
model with the new 
observation. 

e =©Check convergence 
criteria based on_ the 
chosen threshold e. 

e 6If convergence is 
achieved, exit the loop. 

Step 4: | OptimalHyperparameterSelection() 


a. Select the optimal 
hyperparameters based on the 
refined surrogate model. 

Step 5: RetrainModelWithOptimalHyperpa 
rameters(Optimal 
Hyperparameters): 

a. Retrain the Random Forest 
model using the optimal 
hyperparameters on the full 
training set. 

Step 6: | EvaluateModelPerformance 
(Retrained Model, Dyo51, E): 

a. Generate predictions from the 
retrained Random Forest model 
on a separate test dataset Drest. 

b. Calculate the model 
performance using the chosen 
evaluation metric E by 
comparing the predicted values 
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with the true values from the 
test dataset. 


Step 7: | OutputResults(): 
a. Return the optimal 
hyperparameters. 


b. Return the retrained Random 
Forest model. 

c. Return the model performance 
based on the evaluation metric. 


4. CONSUMER REVIEWS DATASET 

The "Consumer Reviews Dataset," 
encompassing 34,000 reviews for Amazon products, 
becomes a focal point for this paper, delving into the 
intricate relationship between keywords in review 
texts and corresponding sentiment analysis. 
Researchers can map specific keywords to review 
ratings, uncovering nuanced insights into how 
particular terms influence overall product 
perceptions. The study may also explore the 
evolution of keyword sentiment over time and its 
impact on consumer sentiments concerning the 
review count. By refining sentiment models through 
this keyword-centric approach and considering the 
substantial review count, the paper aims to enhance 
the precision of sentiment analysis tools, providing a 
valuable contribution to natural language processing 
and sentiment modelling in the context of e- 
commerce and consumer reviews on Amazon 
products. 


Table 1. Features of Consumer Reviews Dataset 


Feature Description 
id Unique identifier for each 
entry 
Date when the entry was 
dateAdded added 
to the dataset 
dateUpdated Date when the entry was last 
updated 
name Name of the product 
Amazon Standard Identifica- 
asins —tion Numbers 
associated with the product 
brand Brand of the product 
, Categories to which the produc 
categories 
belongs 
primary Primary category to which the 
Categories product is assigned 
, URLs of images associated wit} 
pagel nts the product 
k Keywords or key phrases 
oie associated with the product 


E-ISSN: 1817-3195 
manufacturer | Manufacturer of the product 
manufacturer Manufacturer's identification 

Number number for the product 


reviews. date Date of the consumer reviews 


Dates when the reviews were 
observed or recorded 
Indicator of whether the 
reviewer claims to have 
purchased the product 
Indicator of whether the 
reviews.doRecom reviewer recommends 

the product 

Unique identifier for each 


reviews 
.dateSeen 


reviews. didPurch 


reviews. id é 
review 
reviews. Number of users who found 
numHelp ful the review helpful 
ne Rating given by the reviewer 
reviews. URLs of the sources from 
sourceURLs which reviews were obtained 


reviews. text Text content of the reviews 


: . Title or summary of the 
reviews. title yor 


reviews 
reviews. : 
Username of the reviewer 
username 
URLs of the sources providin 
sourceURLs f c g 


about the product 


5. RESULTS AND DISCUSSIONS 
5.1. Positive Rate Analysis 

Figure 1 presents a comprehensive analysis 
of the True Positive Rate (TPR) and False Positive 
Rate (FPR) for three classification algorithms: 
CCHAN, HNN-BCA, and the proposed method, 
BO-RF. These metrics are crucial in evaluating the 
performance of classification models, especially in 
scenarios where the cost of false positives and false 
negatives varies. 


5.1.1. True Positive Rate (TPR): 

TPR, also known as sensitivity, measures 
the proportion of correctly identified positive 
instances out of all positive ones. A higher TPR 
indicates that the classifier effectively identifies 
positive cases, minimizing the likelihood of false 
negatives. For the CCHAN algorithm, the TPR 
stands at 63.228%. This suggests that CCHAN 
correctly detects approximately 63.228% of positive 
instances in the dataset. However, there is room for 
improvement in capturing more positive cases. The 
HNN-BCA algorithm exhibits a slightly higher TPR 
of 67.318%. This implies that HNN-BCA performs 
marginally better than CCHAN in identifying 
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positive instances, accurately capturing around 
67.318% of them. In contrast, the proposed BO-RF 
algorithm achieves a significantly higher TPR of 
89.440%. This indicates a substantial improvement 
over the other two algorithms, with BO-RF 
successfully identifying approximately 89.440% of 
positive instances, showcasing its efficacy in 
positive instance detection. 


OCCHAN OHNN-BCA OBO-RF 


True Positive Rate False Positive Rate 
Performance Metrics 


Figure 1. Positive Rate 


Table 1. Positive Rate Result Values 


Classification| True Positive| False Positive 
Algorithms Rate (%) Rate (%) 
CCHAN 63.228 43.791 
HNN — BCA 67.318 35.412 
BO — RF 
(Proposed) 89.440 11.396 


5.1.2. False Positive Rate (FPR) 

The FPR quantifies the proportion of 
incorrectly identified positive instances out of all 
negative ones. A lower FPR indicates a reduced rate 
of false alarms or false positives, which is desirable 
in many classification tasks. For the CCHAN 
algorithm, the FPR is relatively high at 43.791%. 
This suggests that CCHAN tends to classify negative 
instances as positive, leading to many false alarms. 
The HNN-BCA algorithm demonstrates a lower 
FPR of 35.412% compared to CCHAN. This 
indicates an improvement in minimizing false 
alarms, with HNN-BCA making fewer erroneous 
positive predictions among actual negative 
instances. The proposed BO-RF algorithm achieves 
a substantially lower FPR of 11.396%. This signifies 


a significant reduction in false alarms compared to 
CCHAN and HNN-BCA, showcasing BO-RF's 
capability to accurately classify negative instances 
without erroneously labelling them as positive. 


5.1.3. Interpretation and Implications of Psotive 
Rate Analysis 

From the analysis of Figure 1, several 
insights can be gleaned regarding the performance of 
the classification algorithms. While CCHAN and 
HNN-BCA demonstrate moderate performance in 
TPR and FPR, they both exhibit limitations in 
accurately capturing positive instances and 
minimizing false alarms. However, the proposed 
BO-RF algorithm outperforms CCHAN and HNN- 
BCA, achieving higher TPR and significantly lower 
FPR. This suggests that BO-RF offers a more robust 
and reliable solution for positive instance detection 
while effectively reducing false alarms, making it a 
promising approach for classification tasks where 
precision and recall are critical metrics. Figure 1 
underscores the importance of evaluating both TPR 
and FPR in assessing the effectiveness of 
classification algorithms. By considering these 
metrics, stakeholders can make informed decisions 
about algorithm selection and optimization strategies 
to enhance model performance and reliability. 


5.2. Negative Rate Analysis 

Figure 2 presents a detailed analysis of the 
True Negative Rate (TNR) and False Negative Rate 
(FNR) for three classification algorithms: CCHAN, 
HNN-BCA, and BO-RF. These metrics are essential 
in understanding the ability of classifiers to identify 
negative instances and avoid false negative 
predictions accurately. 


5.2.1. True Negative Rate (TNR) 

TNR, also known as specificity, measures 
the proportion of correctly identified negative 
instances out of all actual negative instances. A 
higher TNR indicates that the classifier effectively 
identifies negative cases, minimizing the likelihood 
of false positive predictions. For the CCHAN 
algorithm, the TNR is 56.209%. This implies that 
CCHAN correctly identifies approximately 
56.209% of negative instances in the dataset. 
However, there is room for improvement in 
capturing more negative cases accurately. The HNN- 
BCA algorithm exhibits a higher TNR of 64.588% 
compared to CCHAN. This suggests that HNN-BCA 
performs better in identifying negative instances, 
accurately capturing around 64.588% of them. In 
contrast, BO-RF achieves a significantly higher 
TNR of 88.604%. This indicates a substantial 
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improvement over both CCHAN and HNN-BCA, 
with BO-RF successfully identifying approximately 
88.604% of negative instances, showcasing its 
efficacy in negative instance detection. 


OCCHAN OHNN-BCA OBO-RF 


True Negative Rate False Negative Rate 
Performance Metrics 
Figure 2. Negative Rate Analysis 


Table 2. Positive Rate Result Values 


Classification | True Negative Balse 
Algorithms Rate (%) Megarive 
Rate (%) 
CCHAN 56.209 36.772 
HNN — BCA 64.588 32.682 
BO — RF 
(Proposed) 88.604 10.560 


5.2.2. False Negative Rate (FNR) 

FNR_ quantifies the proportion of 
incorrectly identified negative instances out of all 
positive ones. A lower FNR indicates a reduced rate 
of missing positive instances, which is crucial for 
applications where identifying positive instances is 
paramount. For the CCHAN algorithm, the FNR is 
36.772%. This suggests that CCHAN fails to predict 
approximately 36.772% of positive instances, 
indicating room for improvement in capturing more 
positive cases. The HNN-BCA algorithm 
demonstrates a slightly lower FNR of 32.682% 
compared to CCHAN. This implies that HNN-BCA 
performs marginally better in avoiding false 
negative predictions, missing approximately 
32.682% of positive instances. BO-RF achieves a 
substantially lower FNR of 10.560%, indicating a 
significant improvement over both CCHAN and 
HNN-BCA. BO-RF effectively minimizes false 


negative predictions, missing only approximately 
10.560% of positive instances. 


5.2.3. Interpretation and Implications of Negative 
Rate Analysis 

The interpretation of the analysis presented 
in Figure 2 holds significant implications for 
selecting and optimizing classification algorithms in 
real-world applications. The observed differences in 
TNR and FNR among the evaluated algorithms 
underscore the varying performance levels in 
accurately identifying negative instances and 
avoiding false negative predictions. Specifically, the 
BO-RF algorithm is the most effective in negative 
instance detection, achieving a substantially higher 
TNR and lower FNR than CCHAN and HNN-BCA. 
This superiority of BO-RF suggests its potential as a 
preferred choice for tasks where minimizing false 
negative predictions is crucial, such as medical 
diagnosis or anomaly detection. These findings 
provide actionable insights for practitioners, 
enabling them to make informed decisions regarding 
algorithm selection and optimization strategies to 
enhance model performance and reliability in 
scenarios where accurately identifying negative 
instances 1s paramount. 


OCCHAN OHNN-BCA OBO-RF 


Classification F-Measure 
Accuracy 


Performance Metrics 


Figure 3. Classification Accuracy and F-Measure 
Analysis 


5.3. Classification Accuracy and F-Measure 
Analysis 

Figure 3 offers a technical evaluation of 
Classification Accuracy (CA) and F-Measure (FM) 
for three distinct classification algorithms: CCHAN, 
HNN-BCA, and the proposed BO-RF. These metrics 
provide critical insights into the algorithms' 
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predictive performance and ability to balance 
precision and recall. 


Table 3. Classification Accuracy and F-Measure Values 


Classification | Classification | F —Measu 
Algorithms Accuracy (%) re (%) 
CCHAN 59.554 59.840 
HNN — BCA 65.934 66.084 
BO — RF 
(Grenceed) 89.026 89.165 


5.3.1. Classification Accuracy (CA) 

CA, a fundamental metric, quantifies the 
percentage of correctly predicted instances from the 
total dataset. The algorithms achieve varied levels of 
CA, reflecting their respective predictive accuracies. 
CCHAN employs a_ hierarchical attention 
mechanism and convolutional operations to capture 
intricate patterns in text data, contributing to its 
ability to classify around 59.554% of instances 
correctly. HNN-BCA leverages a hybrid neural 
network architecture along with the Binary 
Coordinate Ascent Algorithm for optimization, 
resulting in superior predictive accuracy compared 
to CCHAN. BO-RF integrates Bayesian 
Optimization with Random Forest, enabling 
efficient hyperparameter tuning and feature 
selection. This contributes significantly to its highest 
CA of 89.026%, showcasing its superior predictive 
accuracy. 


5.3.2. F-Measure (FM) 

FM, balancing precision and _ recall, 
comprehensively evaluates classifier performance. 
The algorithms achieve varying FM_ scores, 
reflecting their ability to balance precision and 
recall. CCHAN's utilization of convolutional and 
attention mechanisms leads to a reasonable balance 
between precision and recall, resulting in an FM of 
59.840%. HNN-BCA's hybrid neural network 
architecture and optimization strategy contribute to 
a slightly higher FM of 66.084% compared to 
CCHAN, indicating a better balance between 


precision and recall. BO-RF's integration of 
Bayesian Optimization facilitates robust 
hyperparameter tuning, resulting in superior 


precision and recall balance, reflected in its highest 
FM of 89.165% among the algorithms evaluated. 


5.3.3. Interpretation and Implications: 

CCHAN offer a moderate level of 
predictive accuracy and precision-recall balance. 
While these networks excel in capturing nuanced 
patterns in textual data, their performance suggests 
enhanced opportunities in accurately classifying 
sentiment polarity in online shopping reviews. 
HNN-BCA _ exhibits improved performance 
compared to CCHAN. By leveraging a hybrid neural 
network architecture and optimization strategies, 
HNN-BCA_ demonstrates superior predictive 
accuracy and precision-recall balance, making it a 
promising choice for sentiment analysis in online 
shopping. BO-RF emerges as the top performer 
regarding predictive accuracy and precision-recall 
balance. BO-RF optimizes hyperparameters and 
feature selection by integrating Bayesian 
Optimization with Random Forest, leading to robust 
sentiment analysis results. In online shopping, BO- 
RF's superior performance indicates its potential to 
accurately classify customer sentiments expressed in 
online reviews, enabling businesses to gain valuable 
insights into customer satisfaction and preferences. 


High-performing classification algorithms, 
such as BO-RF, enable businesses to gain deeper 
insights into customer sentiments regarding their 
products and services. By accurately classifying 
sentiments expressed in online reviews, companies 
can identify areas of improvement, address customer 
concerns, and tailor their offerings to meet customer 
needs effectively. In the competitive landscape of 
online shopping, accurate sentiment analysis is 
crucial for informed decision-making. Algorithms 
with superior predictive accuracy, like BO-RF, 
empower businesses to make data-driven decisions 


regarding product development, marketing 
strategies, and customer engagement initiatives, 
ultimately enhancing competitiveness and 


profitability. Understanding customer sentiments 
allows businesses to proactively address issues, 
resolve complaints, and improve the shopping 
experience. By leveraging advanced sentiment 
analysis algorithms, companies can identify positive 
and negative sentiments in real time, enabling them 
to respond promptly to customer feedback and 
strengthen customer relationships. Sentiment 
analysis algorithms can also detect fraudulent 
activities, such as fake reviews or product listings, in 
online shopping platforms. By accurately classifying 
sentiments expressed in reviews, algorithms can flag 
suspicious activities, helping businesses maintain 
trust and integrity in their online platforms. 
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Figure 3's insights into classification 
accuracy and pprecision-recall balance have 
significant implications for sentiment analysis in 
online shopping. By leveraging high-performing 
algorithms like BO-RF, businesses can gain valuable 
insights, improve decision-making processes, 
enhance customer experiences, and mitigate risks 
associated with fraudulent activities, ultimately 
driving success in the dynamic landscape of online 
retail. 


6. CONCLUSION 


This research has presented Bayesian 
Optimization-inspired Random Forest (BO-RF) as a 
potent tool for refining sentiment analysis in online 
shopping. The study identified challenges inherent in 
sentiment analysis, particularly in effectively 
processing vast amounts of unstructured text data 
while maintaining accuracy amidst noise and 
contextual complexities. The proposed approach 
introduces BO-RF as a novel solution, leveraging the 
synergy between Bayesian Optimization and 
Random Forest algorithms to overcome these 
challenges. Through meticulous experimentation 
and evaluation, BO-RF demonstrated superior 
performance to conventional methods, showcasing 
heightened accuracy and efficiency in sentiment 
classification tasks. Implementing BO-RF signifies 
a significant stride forward in sentiment analysis 
methodologies, offering businesses enhanced 
capabilities to decipher customer sentiments 
expressed in online reviews and feedback. By 
leveraging the insights gleaned from sentiment 
analysis, companies can make more informed 
decisions, refine product offerings, and ultimately 
elevate the overall shopping experience for 
customers. Moving ahead, continued research 
endeavours could explore the integration of BO-RF 
in diverse applications beyond online shopping, 
fostering innovation and advancement in sentiment 
analysis methodologies. Moreover, ongoing 
refinement and exploration of BO-RF's capabilities 
promise to enhance sentiment analysis accuracy and 
efficiency further, propelling the field towards 
continuous evolution and improvement. 
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